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ABSTRACT 

With the rapid development of location-based social networks (LB- 
SNs), spatial item recommendation has become an important means 
to help people discover attractive and interesting venues and events, 
especially when users travel out of town. However, this recom¬ 
mendation is very challenging compared to the traditional recom- 
mender systems. A user can visit only a limited number of spatial 
items, leading to a very sparse user-item matrix. Most of the items 
visited by a user are located within a short distance from where 
he/she lives, which makes it hard to recommend items when the 
user travels to a far away place. Moreover, user interests and be¬ 
havior patterns may vary dramatically across different geographi¬ 
cal regions. In light of this, we propose Geo-SAGE, a geographical 
sparse additive generative model for spatial item recommendation 
in this paper. Geo-SAGE considers both user personal interests 
and the preference of the crowd in the target region, by exploit¬ 
ing both the co-occurrence pattern of spatial items and the con¬ 
tent of spatial items. To further alleviate the data sparsity issue, 
Geo-SAGE exploits the geographical correlation by smoothing the 
crowd’s preferences over a well-designed spatial index structure 
called spatial pyramid. We conduct extensive experiments to eval¬ 
uate the performance of our Geo-SAGE model on two real large- 
scale datasets. The experimental results clearly demonstrate our 
Geo-SAGE model outperforms the state-of-the-art in the two tasks 
of both out-of-town and home-town recommendations. 

1. INTRODUCTION 

The rapid development of Web 2.0, location acquisition and wire¬ 
less communication technologies have fostered a number of location- 
based social networks (LBSNs), such as Eoursquare, Gowalla, Eace- 
book Places and Loopt, where users can check-ins at venues and 
share life experiences in the physical world via mobile devices . 
On one hand, the new dimension of location implies extensive knowl¬ 
edge about an individual’s behavior and interests by bridging the 
gap between online social networks and the physical world, which 
enables us to better understand user preferences and design optimal 
recommendation systems. On the other hand, it is valuable to de¬ 
velop the location recommendation service as an essential function 
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to LBSNs to encourage users to explore new locations ||9l. There¬ 
fore, developing recommendation systems for LBSNs to provide 
users with spatial items (e.g., a venue or an event associated with 
a geographic location) has recently attracted increasing research 
attention Emu. This application becomes more important and 
useful when a user travels to an unfamiliar area, where he/she has 
little knowledge about the neighbourhood. In this scenario, the rec- 
ommender system is proposed as recommendation for out-of-town 
users in LBSNs in ||9l. In this paper, we focus on the problem of 
spatial item recommendation, aiming to offer accurate recommen¬ 
dations for both home-town and out-of-town users by mining their 
historical behavior data in LBSNs. 

Spatial item recommendation is a highly challenging problem 
because of the following three main reasons. 1. Data Sparsity. 
While LBSNs expand rapidly, the number of spatial items visited 
by an individual user is rather small compared to the total number 
of spatial items in a LBSN, which results in a very sparse user- 
spatial item matrix. This issue plagues most of the existing col¬ 
laborative filtering recommender systems. 2. Travel Locality. As 
observed in HU, users tend to visit spatial items that are nearby to 
their home locations. Eor example, an analysis of Eoursquare data 
shows that 45% of users travel 10 miles or less, while 75% travel 
50 miles or less HU. Travel locality makes the out-of-town recom¬ 
mendation more challenging. On one hand, there may be few or 
no historical activity records of the target user around the current 
location. On the other hand, similar users to the target user (that is, 
those who exhibit similar spatial item visiting behaviors) may also 
have not visited spatial items near the current location. Again, col¬ 
laborative filtering based approaches will be incapable of provid¬ 
ing effective recommendations. 3. Interest Drift Across Regions. 
When a user travels to a different region, her interests or behavior 
patterns may change. Through Eoursquare APfl we extract the top 
three categories of check-in POIs (points of interest) in three dif¬ 
ferent cities including Boston, Las Vegas and the Gold Coast for a 
group of users. The group size is 3000 and each user of this group 
has check-in records in all three cities. The percentage of check-ins 
of each POI category is shown in Table [T] We observe that when 
users are in Las Vegas, they are more interested in visiting Casino 
(80.32%), Nightlife (10.61%) and Outlet (5.82%), while the same 
users prefer Beach (71.36%), Surf Spot(14.82%) and Theme Park 
(9.60%) when in Gold Coast. A good spatial item recommender 
system should effectively model user interest drift and take it into 
account when making recommendations for out-of-town users. 

Recently, several methods ^ |9l [31] have been developed to 
make recommendations for both home-town and out-of-town users. 
As a matter of fact, these approaches either do not address all three 

^https://developer.foursquare.com/ 



City 

Top POI Types 

Percentage of Check-ins(%) 

Gold Coast (AU) 

Beach 

71.36% 

Surf Spot 

14.82% 

Theme Park 


Las Vegas (US) 

Casino 

80.32% 

Nightlife 

10.61% 

Outlet 


Boston (US) 

College 

78.32% 

Museum 

535^ 

Park 

7:55^ 


Table 1: Illustration of User Interest Drift 


challenges for spatial item recommendation, or address the chal¬ 
lenges with ineffective strategies. For example, Ference et al. ||9l 
proposed a CF-based method that considers both users who have 
visited many common spatial items with the target user and friends 
mined from the target user’s social connection in LBSNs. Including 
visiting records of social friends is supposed to handle data sparsity, 
and also the travel locality. For example, when similar users can¬ 
not provide effective clues for out-of-town recommendations, the 
visiting records of social friends will be used. Nonetheless, ac¬ 
cording to the survey IT), when the user travels more than 100km, 
the check-in probability at the same place visited by any of the so¬ 
cial ties, is merely 10%. Bao et al. m presented a recommender 
system considering both user personal interests and opinions from 
local experts. They model individual user’s interests based on the 
category information of spatial items, which helps overcome data 
sparsity to some extend. The opinions of local experts are used to 
solve the travel locality problem. Unfortunately, since the chosen 
local experts are users who share same/similar interests with the 
target user, this approach cannot address interest drift. 

In this paper, we propose Geo-SAGE, a geographical sparse ad¬ 
ditive generative model for spatial item recommendation. Tradi¬ 
tional mixture models, such as ED. consider multiple facets (e.g., 
time and location) that influence user choosing of spatial items by 
introducing additional latent variables, acting as “switches”, to con¬ 
trol which facet is currently active. As a matter of fact, it is not only 
computationally expensive to learn personalized “switching” vari¬ 
ables for individual users but also difficult to learn these variables 
accurately given sparse datasets. Inspired by the the Sparse Addi¬ 
tive Generative model (SAGE) El, we have designed our similar 
model by adding the effect of all the facets in the exponential space 
to avoid the inference of the latent “switching” variables, aiming to 
achieve improved robustness and predictive accuracy. 

Basically, to model user visiting behaviors, Geo-SAGE takes 
into account both user interests and interest drift. Geo-SAGE learns 
user interests as distributions over a set of latent topics, by mining 
both the co-occurrence patterns of spatial items and their content 
information (e.g., tags and categories). Exploiting content infor¬ 
mation of spatial items not only alleviates the data sparsity issue 
but also addresses the travel locality for out-of-town recommenda¬ 
tion. The content of spatial items serves as the medium to transfer 
user interests learned from home town to unfamiliar regions. 

To adapt to interest drift across regions, Geo-SAGE recognizes 
two roles of an individual user in a spatial region: a local or a 
tourist. Given a location, visiting records from local users will be 
mined to learn native preference as a distribution over latent topics. 
Similarly, visiting records from tourists will be used to learn tourist 
preference. Users with the same role at a location are more likely 
to have similar preferences and behavior patterns. Thus, to rec¬ 
ommend spatial items to a target user u at location /, we consider 
not only u’s personal interests, but also the preference and visiting 
behaviors of the crowd who have the same role with u at location 1. 

However, by using the visiting history to model native preference 
and tourist preference, we face the data sparsity issue again, espe¬ 


cially when the target region is small. When there are not enough 
visiting records in the target region, the two variables cannot be 
inferred accurately. To further overcome the data sparsity, Geo- 
SAGE integrates the spatial index - spatial pyramid which is a tree 
structure proposed in ClI. It is first constructed by partitioning lo¬ 
cations of spatial items into spatial grids of varying sizes at different 
hierarchies. Then, Geo-SAGE applies the additive framework [l] 
M to learn native preference and tourist preference of each region. 
Briefly, when learning the native preference and tourist preference 
in a region, the two variables learned for all of the region’s ancestor 
grids in the spatial pyramid will be added. By doing this, if there 
are few or no activities in a region, we can still infer its native pref¬ 
erence and tourist preference guided by its ancestors. By integrat¬ 
ing with the spatial pyramid, Geo-SAGE gains another advantage 
by allowing users to switch between different scales of geo-regions 
(e.g., zoom in/out on a Google Map) without re-learning the pa¬ 
rameters. In this way, this model can be connected with Google 
Map seamlessly. 

The main contributions of our work are summarized as follows. 

• We design a novel geographical sparse additive generative 
model (Geo-SAGE) for spatial item recommendation which 
incorporates and exploits item content information and the 
crowd’s preference at a region to address the problems of 
data sparsity, travel locality and interest drift across regions. 

• We reflne the crowd’s preference at a region by distinguish¬ 
ing native preference from tourist preference, based on which 
Geo-SAGE exploits the geographical correlation by smooth¬ 
ing these preferences over a well-designed spatial index struc¬ 
ture - spatial pyramid, to further alleviate the data sparsity . 

• We conduct extensive experiments to evaluate the perfor¬ 
mance of the proposed Geo-SAGE on two real large-scale 
datasets. The experimental results clearly demonstrate that 
Geo-SAGE outperforms the state-of-the-art in the tasks of 
both out-of-town and home-town recommendations 

This paper is organized as follows: Section 2 provides prelim¬ 
inary concepts about the SAGE model. Section 3 formulates our 
spatial item recommendation problem and describes the Geo-SAGE 
model. Section 4 discusses the experimental evaluation of Geo- 
SAGE. Existing research related to our work is surveyed in Section 
5. Section 6 closes this paper with some conclusive remarks. 

2. PRELIMINARIES ABOUT SAGE 

Our model is inspired by the Sparse Additive Generative Model 
(SAGE) im, which has been shown to be effective generative mod¬ 
els without explicit switching variables miiiiia. The basic idea 
of the model is that, if a variable is affected by several components, 
it can be generated by the mixture of these components without 
any explicit indicator variable. The key difference to traditional 
mixture models is that the mixture occurs in terms of natural pa¬ 
rameters of the exponential family instead of distributions. Such a 
model is robust given limited training data as it does not have to 
infer a complex indicator variable distinguishing the set of causes. 

To provide a clearer explanation of SAGE, we use a traditional 
probabilistic mixture generative model, LCA-LDA JST], as an il¬ 
lustrative example. LCA-LDA, is a location-content-aware model 
that aims to mimic the process of human decision making on spa¬ 
tial items. The model considers the user’s personal interest 
and the influence of local preference (note that it does not 

distinguish between native preference and tourist preference) in a 
unifled manner, and automatically leverages the effect of the two 
factors. Speciflcally, given a querying user u at a target location /, 














the likelihood that user u will prefer item v is computed by com¬ 
bining these two factors through a linear combination of them as 
follows. 

= XuP{v\ei^^^) + (1 - ( 1 ) 


where An is the “switching” variable that needs to be inferred for 
each user. Obviously, it cannot be inferred accurately when the 
training data for the individual user is sparse. In contrast, SAGE 
combines the two generative facets through simple addition in ex¬ 
ponential space as shown in Equation |2l Clearly, it avoids the need 
for latent switching variables. 






3. GEOGRAPHICAL SAGE MODEL 


( 2 ) 


In this section, we first formulate the problem definition, and 
then present our proposed geographical SAGE model (Geo-SAGE). 


3.1 Problem Definitions 

Eor ease of presentation, we define the key data structures and 
notations used in this paper. Table |2] also lists them. 

Definition 1. (Spatial Item) A spatial item is an item associated 
with a geographical location (e.g., a restaurant or a cinema). 

In our model, a spatial item has three attributes: identifier, loca¬ 
tion and contents. We use v to represent a spatial item identifier, ly 
to denote its corresponding location identifier and Wy to represent 
the set of words describing the semantic information of the item 
(e.g., tags and categories). POI is a kind of spatial item. The lo¬ 
cation information available for each spatial item v in the collected 
raw datasets is in the form of the (latitude, longitude) pair. Then a 
spatial pyramid structure |[T7ll24l . is applied to partition and index 
the entire geographic area. The granularities depend on the nature 
of the application and range from cities to streets. The details of 
the spatial pyramid are described in subsection 3.4. 

Definition 2. (User Home Location) Eollowing the recent work 
of (m, given a user u, we define the user’s home location as the 
place where the user lives, denoted as lu- 

Note that, we assume a user’s home location is “permanent” in 
our problem. In other words, a home location is a static location 
instead of a real-time location that is “temporally” related to him 
(e.g., the places where she/he is traveling). Due to privacy, user 
home locations are not always available. For a user whose home 
location is not explicitly given, we adopt the method similar to ia 
by inferring the user’s home location as the cell in the spatial pyra¬ 
mid with the most of his/her check-ins. 

Definition 3. (User Activity) A user activity is made of a five 
tuple (u, u, Wy, s) which indicates that the user u visits the 
spatial item v, located at ly and described as Wy, in the role of s. 
If s = 0, the user is recognized as a local and the activity occurs in 
u’s home town. If s = 1, the user u plays the role of tourist when 
visiting v. 

Definition 4. (User Profile) For each user u, we create a user 
profile Du, which is a set of user activities associated with u. The 
dataset D used in our model consists of user profiles, that is, D = 
{Du :ueU}. 

Then, given a dataset D as the union of a collection of user 
profiles, we aim to provide spatial item recommendation for both 
home-town and out-of-town users. We formulate our problem that 
takes into account both of the two scenarios in a unified fashion as 
follows. 


Variable 

Interpretation 

U,R,V 

the set of users, locations and spatial items 

W 

the vocabulary set 

Du 

the profile of user u 

'^u,i 

the spatial item of record in Du 

lu,i 

the location of spatial item Vu,i 

lu 

the home location of the user u 

Wu,i 

the set of words describing spatial item Vu^i 


the content word describing spatial item Vu,i 


if the user n is a local or tourist on location lu,i 


Table 2: Notations of The Input Data 

Problem 1. ( Spatial Item Recommendation ) Given a user ac¬ 
tivity dataset D and a target user u with his/her current location 
I (that is, the query is q = (u, /)), our goal is to recommend a 
list of spatial items that u would be interested in. Given a distance 
threshold d, the problem becomes an out-of-town recommenda¬ 
tion if the distance between the target user’s current location and 
his/her home location (that is, \l — lu\) is greater than d. Otherwise, 
the problem is a home-town recommendation. 

Following related studies Ena, we set d — lOO/cm in our 
work, since a distance around 100/cm is the typical radius of human 
“reach” — it takes about 1 to 2 hours to drive such a distance. 

з. 2 Model Description 

To model user activities, we propose a geographical sparse addi¬ 
tive generative model (Geo-SAGE). Figure [T] shows the graphical 
representation of Geo-SAGE. We first introduce the notations of 
our model, which are listed in Table 0 Our input data, that is, 
users’ activity profiles, are modeled as observed random variables, 
shown as shaded circles in Figure[T] Similar to existing models ifTdl 
ED, the topic index of each user activity is considered as a latent 
random variable, which is denoted as z. 

Intuitively, a user chooses a spatial item at a given location by 
matching his/her personal interests with the content of that item. 
Inspired by the early work on user interest modeling ifTdll^lSTl . 
Geo-SAGE also adopts latent topics to characterize users’ interests. 
Specifically, we infer an individual user’s interest distribution over 
a set of topics according to his/her visited spatial items and their as¬ 
sociated contents, denoted as . Thus, our model alleviates the 
data sparsity, especially for the out-of-town recommendation, as 
the content of spatial items plays the role of medium through which 
user interests inferred from their home town can be transferred to 
out-of-town regions. Besides, we also introduce a background dis¬ 
tribution over topics 0^ to capture common topics among all users. 
The purpose of using a background model 0^ is to make the user 
interests learned from the dataset more discriminative. 

To adapt to user interest drift across regions, we exploit the pref¬ 
erence of the crowds who share the same role with the target user 

и. For example, the preference of the tourists will be leveraged if 

the target user is currently out-of-town. Technically, we introduce 
two parameters: native preference and tourist preference. Given a 
location I, the native preference represents the preference of people 
living at location I, denoted as in contrast, the tourist pref¬ 

erence represents the preference of tourists travelling in location I, 
denoted as Both and are represented by 

distributions over topics. Note that, distinguishing native prefer¬ 
ence from tourist preference is one of the fundamental differences 
between our model and the LCA-LDA model ED which also ex¬ 
ploits the local activity records at the target location. 

To take full advantage of the strengths of both content-based and 
collaborative filtering-based recommendation methods, a topic z 
in our Geo-SAGE model is not only associated with a word dis¬ 
tribution but also a distribution over spatial items 















Variable 

Interpretation 

K 

the number of topics 

Zu,i 

the topic assigned to spatial item Vu,i 


the topic distribution of the background 

nuser 

the topic distribution, representing the intrinsic interest 
of user u 

Qnative 

the topic distribution of 1 , representing the native 
preference at 1 

Qtourist 

the topic distribution of 1, representing the tourist 
preference at 1 


the word distribution of the background 


the spatial item distribution of the background 


content word distribution of topic 2 ; 

o 

o' 

spatial item distribution of topic 2 ; 


Table 3: Notations of Model Parameters 


This design enables and to be mutually influenced 

and enhanced during the topic discovery process by associating 
them. Thus, the discovered topic z, on one hand, can cluster the 
content-similar items together. On the other hand, it can also cap¬ 
ture the item co-occurrence patterns to link relevant items together, 
similar to item-based collaborative Altering methods. Besides, we 
also introduce two background models for words and items, respec¬ 
tively: 0° and The purpose of using background models is to 
make the topics learned from the dataset more discriminative, since 
0° and assign high probabilities to non-discriminative and non- 
inf ormative words and items. 

In summary, our Geo-SAGE model aims to mimic the process of 
human decision making on visiting spatial items. Geo-SAGE as¬ 
sumes that users’ activities are influenced by three factors: 1) the 
user’s intrinsic interests , 2) the tourists’ preference 
if the user is a tourist; or the natives’ preference if the user 

is a local, 3) the general public’s preference 0^. All these factors 
are represented by a distribution over a set of topics, and influence 
the user’s decision-making through generating a topic z. How¬ 
ever, different from traditional mixture models ED which have to 
consider the multiple factors by introducing additional “switching” 
variables, our Geo-SAGE model adds the influence of the three fac¬ 
tors in the exponential space, which is more efficient and robust by 
avoiding inferring “switching” variables. 

The generative process of the Geo-SAGE model for an individual 
user activity in the user profile Du is as follows. 


Draw a topic index Zu,i 

-V I r, ^0 nuser nnative ntourist\ 

Zu,i ^ ) 

Eor each content word Wu,i,ni^ Wu,i, draw 


P{Wu,i,ri 


±topic\ 


• Draw a spatial item Vu,i 
Vu,i ~ P{Vu,i\lp°, 

Eor each user activity, Geo-SAGE first chooses the topic this ac¬ 
tivity is about. To generate the topic index z, we utilize a multino¬ 
mial model as expressed in Equation [3] 

P{Zu,i\u, Su,i, Qnative^ gtourist^^ 

=p{zuAe° + C'" + (1 - su,i) X er*”® + su,i X e*""”**) 

where 0^ is the global distribution over topics. is a user- 

dependent distribution over topics which represents the interests 
of u. s is an observable variable which only has two values: 0 
and 1. If s equals to 0, u is a local at I and otherwise a tourist. 
Qnatzve jg^otcs the uativc preference while denotes the 

tourist preference at /, both of which are distributions over topics. 



Figure 1: The Graphical Representation of Our Model 

P{z\0^ + + (1 - s) X + s X denoted as 

OLu,s,i,z, can be computed as in Equation [6| Once the topic z is 
generated, the spatial item v and the associated content words are 
generated as expressed in Equations |4| and [S] respectively. 

= P(vuAA° (4) 
P{wu,i,A‘l>° ^ = P{y^u,i,A<l>° + ) (5) 

where + = ')z,v and + = Az.w are 

computed as in Equation|6l Note that in order to model topics based 
on the background word/item distributions, for each topic, Geo- 
SAGE models the difference from the background word/item dis¬ 
tribution in log-frequencies, instead of the frequencies themselves. 

3.3 Model Inference 

Our goal is to learn parameters that maximize the marginal log- 
likelihood of the observed random variables w, v and s, and the 
marginalization is performed with respect to the latent random vari¬ 
able z. However, it is difficult to be maximized directly. Therefore, 
we apply a mixture between EM and a Monte Carlo sampler, called 
Gibbs EM algorithm 1281 . to maximize the complete data likeli¬ 
hood in Equation [T] where 0 is the set of all the parameters. In the 
E-step, we sample latent topic assignments by fixing all other pa¬ 
rameters using Gibbs sampling. In the M-step, we optimize model 
parameters 0 by fixing all topic assignments. The two steps are 
iterated until convergence. 

More specifically, we iteratively draw latent topic z for all activ¬ 
ities in the E-step. According to the Gibbs Sampling, when sam¬ 
pling as expressed in EquationfS] we assume all other variables 
are fixed. z^u,i represents the topic assignments for all user activi¬ 
ties except the z’th activity for user u. 


P{Zu^i\z^u,i, W, V, S, 0) 

n=l 


( 8 ) 




In the M-step, we optimize the parameters 0 to maximize the 
log likelihood of the objective function with all topic assignments 
fixed. To update the parameters, we use the gradient descent learn¬ 
ing algorithm PSSG (Projected Scaled Sub-Gradient) 1^ . which is 
designed to solve optimization problems with LI regularization on 




























*^u,s,l,z 


exp{el + + (1 - s) X 6>[;f + s X 


exp{ci,l + ct>Xl'^) 


expi^l + 


ea: 


+ (1 - ^) X + . X e\o^pst^ ’ exp{<i>l^ + 0*%^) ’ E.. ea:p(V^o + 

P(z, w, v|0, s, u) = P(z|s, u, 0°,(/)^°^"^)P(v|z, 


\u\ \Du\ 

HR' 


\U\ \Du\ l^^^6,^ I 

n n n 


yi yi iX XX XX 

u = l i = l n = l u = l i = l 


\u\ \Du\ 

n n^ 


( 6 ) 


(V) 


parameters. More importantly, PSSG is scalable because it uses the 
quasi-Newton strategy with line search that is robust to common 
functions. Let L be the log-likelihood of the model. According 
to the limited-memory BFGS t22l updates for the quasi-Newton 
method, the gradients of model parameters 9°, and 

gtourist pj-Qvided as follows. 


0 tourist 

ll / 


Entire system area (Level 1) 


0 tourist , 

i2 / 


2^ 


xz: 




2*2 grid (Level 2) 


4*4 grid (Level 3) 


dL 


|P| IPt.1 


= d{z) - y y' 
a6>o y y 

^ U=1 1=1 


^u,Su^i du,ii^ 


dL 


\Du\ 

= d{u,z) - y CXu,s^ 
i=l 


du i,z 


dL 


|P(0I 


QQnative 


= (1 - s) X id{l, z)- 


^Uj ,Sj 


,l,z) 


dL 


QQtourist 


i=i 

|P(0I 

= sx(d(l,z)- y 

J=1 


(9) 


( 10 ) 


( 11 ) 


( 12 ) 


where d{z) is the number of activities assigned to topic z, d{u, z) 
represents how many activities are assigned to topic z in Du, d{l, z) 
denotes the number of activities assigned to topic z at location I, 
and D{1) is the set of activities occurring at the location 1. 

Similarly, the gradients of model parameters 0°, and 

^topic computed as follows: 


dL ^ 

= d{w) - y d{z) X dz^n 

0(Pw ^ = 1 


dL 


/ topic 

d(pz,w 


= d{z, w) — d(z) X I3z,w 


dL 

dL 


= d{v) - y d{z) X -fz,v 


= d(z,v) — d(z) X jz,v 


(13) 


(14) 


(15) 


(16) 


where d{w) is the number of activities where the word w appears, 
and d{z,w) is the number of activities where the word w is as¬ 
signed to the topic z. d{v) is the number of activities associated 
with item u, and d{z,v) represents the number of activities in which 
topic z is assigned to item v. 

3.4 Spatial Smoothing 

To combat data sparsity when modelling the native preference 
and tourist preference, we adopt a tree structure, called spatial pyra¬ 
mid, to partition and index the entire geographic area. The spatial 
pyramid is proposed in 03, which is constructed by partitioning 
item locations into spatial regions of varying sizes at different hi¬ 
erarchies. More specifically, the spatial pyramid decomposes the 
space into H levels. The level 0 has only one grid cell. For a given 
level h, the space is partitioned into 4^ grid cells of equal area. 
Thus, the space can be divided recursively into numerous cells at 


8*8 grid (Level 4) 



0,: 


Figure 2: The Spatial Pyramid 

different levels with different granularity. The graphical represen¬ 
tation of the spatial pyramid is illustrated in Figure |2l 

One of the fundamental assumptions in spatial data mining is 
that everything is related to everything else but nearby things are 
more related than distant things, which is proposed in oni as the 
first law of geography 113. This law is known as the “spatial au¬ 
tocorrelation”. The spatial pyramid structure can encode this law 
in an effective manner. That is, for each location /, it can be repre¬ 
sented by a path from the root node to its corresponding leaf node. 
We use a vector to describe the path, that is, (h, I 2 ,..., Ih, • • •, 
Ih)- Based on the vector representation of a location, we can easily 
compute the proximity between two locations. For example, if two 
locations in the spatial pyramid share more ancestors, then these 
two locations are more proximate. 

When user activity data at a location I is very sparse, both the 
native preference 


not be estimated accurately. To address this issue, we exploit ge¬ 
ographical correlation to enhance the prior knowledge about the 
model parameters and Intuitively, if two locations 

I and I' are proximate in the geographical space, then their local 


preferences 0i 


and 6>r 


should be similar to each other. 


The intuition can be applied to the tourist preferences and 

Qtourist sij^parly. To integrate the information of geographical cor¬ 
relation into our Geo-SAGE model, we apply the additive frame¬ 
work (msi to compute native preference and tourist pref¬ 
erence at location I based on the path vector representation. 

Specifically, given a location /, its native preference and tourist 
preference are represented as follows. 




^native 

Ih. 


gtourist 


= E' 


)tourist 

^Ih. 


(17) 
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According to the above equations, both the native preference and 





















the tourist preference of a location depend on all of its ancestors up 
to the root. This representation method enables neighboring loca¬ 
tions to share a similar preference as desired, i.e., the preferences 
are smoothed over the spatial pyramid. Meanwhile, if there are few 
or no activities at a location, we can still infer its preference guided 
by its ancestors. Besides, once native preference and tourist pref¬ 
erence for each level are learned, this modeling makes the switch 
over among various granularity fast and convenient by changing the 
lowest level in the model without re-training the parameters. 

In the M-step of the model inference, we optimize the native 
preference and tourist preference using PSSG. Because the two pa¬ 
rameters are computed as the sum of the corresponding parameters 
at each level, we need to calculate the gradients of the parameters 
for each level. Thus, we extend Equations fTTI and \V2\ which infer 
native preference and tourist preference respectively, as follows. 


dL 

Qgnative = (1 “ «) X (dfa, ^ 

j = l 


(19) 


QQtourist ^ ^ ^ V ^Uj,s,lj,z) 

j = l 


( 20 ) 


where d{lh,z) denotes the number of activity records at the lo¬ 
cation Ih assigned to topic z, D{lh) is the set of activity records 
occurring at the location Ih, and Uj denotes the user who generates 
the j-th activity record. 


3.5 Spatial Item Recommendation 

Our proposed Geo-S AGE model is employed to recommend spa¬ 
tial items as follows. Given a user u and his/her target location /, 
our task is to recommend top-A; spatial items that u wish to visit 
from the items that the user has not visited before. More precisely, 
given the user and his/her target location, we first compute the in¬ 
dicator s according to the distance between u’s home location and 
1. After that, for each spatial item v located at I which has not 
been visited by u, we compute the probability that user u visits v 
as follows. 
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where Wv denotes the content words describing item v. We adopt 
the geometric mean for the probability of topic generating the 
word set Wy, considering that the number of words associated with 
different spatial items may be different. 

By building a tree structure for the space, our model supports 
users to switch between different scales of the geo-region conve¬ 
niently (e.g., zoom in/out on Google Map IS), by switching be¬ 
tween different levels in the tree structure. Although previous work 
also supports the change to granularity i3n . it needs to re-train all 
model parameters for the new granularity while our model does 
not. 


4.1 Experimental Settings 

4.1.1 DataSets 

We perform experiments on two real large-scale LBSNs datasets, 
Eoursquare and Twitter. 

Foursquare. This dataset contains the check-in history of 4,163 
users who live in California, USA. Eor each user, it contains the so¬ 
cial networks, home location, check-in venue identifiers, location 
of each venue in terms of latitude and longitude, and the content of 
each check-in venue. The total number of check-ins in this dataset 
is 483,813. Each check-in is stored as user-ID, venue-ID, venue- 
location, venue-content, user-role. Each record in social networks 
is stored as userlD, friendID and the total number of social rela¬ 
tionship is 32,512. This dataset is publicly available 0. The distri¬ 
bution of the check-in activities is described in Eigure |3(a)| Erom 
the distribution, we can see that, for the users who live in Califor¬ 
nia, although most of the check-ins occur in California, there is still 
a number of check-ins occurring in other states. This is an evidence 
of the significance of out-of-town recommendation. 

Twitter. This dataset is based on the publicly available twit¬ 
ter dataset Twitter supports third-party location sharing ser¬ 
vices like Eoursquare and Gowalla, where users of these services 
opt-in to share their check-ins on Twitter. But the original dataset 
does not contain the category or tag information about venues. So, 
we crawled the category and tag information associated with each 
venue from Eoursquare with the help of its publicly available API0 
The enhanced dataset contains 114,058 users and 1434,668 check¬ 
in activities. Each check-in record has the same format with the 
above Eoursquare dataset. Nonetheless, this dataset does not con¬ 
tain user social network information. Eigure [3(b^ describes the dis¬ 
tribution of the check-in activities across the USA. We observe that 
18.22% of check-in activities occur in California, and there are less 
than 7% of activity records occurring in each of the other states. As 
the home locations of users in this dataset are not explicitly given, 
we adopt the method in (m by inferring the user’s home location 
as the cell in the spatial pyramid with the most check-ins. 

To make the experiments repeatable, we make the Twitter dataset 
and our code publicly available 0 

4.1.2 Comparative Approaches 

We compare our Geo-SAGE model with the following three meth¬ 
ods representing the state-of-the-art spatial item recommendation 
techniques. 

LCA-LDA. LCA-LDA is a location-content-aware recommender 
model which is developed to support spatial item recommendation 
for users traveling in new cities ED- This model takes into ac¬ 
count both personal interests and local preferences of each city 
by exploiting both item co-visiting patterns and content of spa¬ 
tial items. Compared with Geo-SAGE, LCA-LDA is a traditional 
mixture model which introduces “switching” variables to consider 
multiple factors. Besides, LCA-LDA ignores the roles of users and 
does not distinguish tourist preference from native preference. 

UPS-CF. UPS-CE, proposed in ||9l, is a collaborative recom¬ 
mendation framework which is especially designed for out-of-town 
users. This framework integrates user-based collaborative filter¬ 
ing and social-based collaborative filtering. That is, it recommends 
POIs to a target user according to the activity records of both his/her 
friends and similar users. 

CKNN. CKNN O projects a user’s activity history into the cat- 


4. EXPERIMENTS 

In this section, we first describe the settings of experiments and 
then demonstrate the experimental results. 


^http://www.public.asu.edu/ hgaol 6/dataset, html 
^ http s: //developer, foursquare. com/ 

"^http ://net.pku. edu. cn/daim/yinhongzhi/index.html 







(a) Foursquare (b) Twitter 

Figure 3: Distribution of User Check-in Activities 


egory space and models user preference using a weighted category 
hierarchy. When receiving a query, CKNN retrieves all users and 
items located in the querying location, formulates a user-item ma¬ 
trix online, and then applies a user-based CF method to predict the 
rating of a querying user on an unvisited item. Note that the simi¬ 
larity between two users in CKNN is computed according to their 
weights in the category hierarchy, making CKNN a hybrid recom¬ 
mendation method. 

To further validate the benefits brought by exploiting the native 
preference or the tourist preference, distinguishing between tourist 
preference and native preference, and spatial smoothing based on 
the spatial pyramid, respectively, we implement three variant ver¬ 
sions of our Geo-SAGE model. 

Geo-SAGE-SI is the simplified version of Geo-SAGE model 
which does not consider the native preference or tourist preference. 
Thus, this model cannot adapt to user interest drift. In this model, 
OL in Equation [6] is reformulated as: 

Geo-SAGE-S2 is the simplified version of Geo-SAGE where we 
ignore the roles of users and do not distinguish between tourist pref¬ 
erence and native preference, similar to LCA-LDA im. In this 
model, OL in Equation [6]is reformulated as: 

au,i,z - ^ ^ 

Geo-SAGE-S3 is the simplified version of Geo-SAGE which 
does not exploit the geographical correlation in the spatial pyra¬ 
mid. Thus, the inferred native preference and tourist preference 
for location I are not reliable when there are few or even no user 
activity records. 

4.1.3 Evaluation Methods 

Since our Geo-SAGE model is designed for both home-town rec¬ 
ommendation and out-of-town recommendation, we evaluate the 
recommendation effectiveness of our model under the two scenar¬ 
ios respectively. Given a user profile in terms of a collection of 
user activities, we divide the user’s activities into a training set and 
a test set. For the scenario of home-town recommendation, we ran¬ 
domly select 30% of the activity records occurring at the user’s 
home town as test set, and use the remaining activity records as the 
training set. Similarly, for the scenario of out-of-town recommen¬ 
dation, we randomly select 30% of the activity records generated 
by the user when he/she travels out of town as the test set, and use 
the remaining activity records as training set. To decide whether an 
activity record occurs when the user is in his/her home town or not, 
we measure the location distance between the user’s home town 
and the spatial item (e.g., \lu — If the distance is greater than 


100/cm, then we assume the activity occurs when the user is out- 
of-town. The threshold d = 100km is selected because a distance 
around 100/cm is the typical human radius of “reach”, which takes 
about 1 to 2 hours to drive. 

According to the above dividing strategies, we split the user ac¬ 
tivity dataset D into the training set Dtrain and the test set Dtest- 
To evaluate the recommendation methods, we adopt the evaluation 
methodology and measurement Recall @/c, which is applied in lEI 
[30][3T|. Specifically, for each user activity record (u, u, Wv, 
s) in Dtest • 

1) we compute the ranking score for spatial item v and all other 
spatial items which are within the circle of radius d centered at ly 
and unvisited by u previously; 2) we form a ranked list by ordering 
all of these spatial items according to their ranking scores. Let r 
denote the position of the spatial item v within this list. The best 
result corresponds to the case where v precedes all the unvisited 
spatial items (that is, r = 1); and 3) we form a top-k recommenda¬ 
tion list by picking the k top ranked spatial items from the list. If 
r < /c, we have a hit (i.e., the ground truth item v is recommended 
to the user). Otherwise, we have a miss. 

The computation of Recall@/c proceeds as follows. We define 
hii@k for a single test case as either the value 1, if the test item v 
appears in the top-k results, or the value 0, if otherwise. The overall 
Recall@/c is defined by averaging over all test cases: 

Recaimk = 

\Dtest\ 

where ij^hit^k denotes the number of hits in the test set, and 
\Dtest I is the number of all test cases. 

4.2 Recommendation Effectiveness 

In this part, we present experimental results of the comparing 
recommendation methods with well-tuned parameters. Figures |4] 
and [5] report the recommendation effectiveness on the Foursquare 
and Twitter datasets, respectively. From the figures, we observe 
that the recall values gradually rise with respect to the increase of k. 
This is because, by returning more spatial items, it is more likely to 
discover the ones that users would like to visit. Note that we show 
only the performance when /c is set to 2, 6,10,14,18. A greater 
value of k is usually ignored for the top-k recommendation task. 

It is apparent that the recommendation methods have significant 
performance disparity in terms of the top-k recall. Figure |4(a)| 
presents the recommendation accuracy in the scenario of out-of- 
town recommendation where the recall of Geo-SAGE is about 0.327 
when k — 2, and 0.535 when k — 10. This means that Geo-SAGE 
has a probability of 32.7% to place an appealing point of interest 
in the top-2 recommendations and 53.5% to place it in the top-10 
recommendations. Clearly, our proposed Geo-SAGE model out¬ 
performs other competitor models significantly, demonstrating the 
advantages of Geo-SAGE over other competitor methods. Several 
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Figure 4: Performance on Foursquare Dataset 


Figure 6: Impact of Different Factors on Foursquare Dataset 
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Figure 5: Performance on Twitter Dataset 

observations are made from the results: 1) Geo-SAGE and LCA- 
LDA perform much better than CKNN. This is because both of 
them exploit the crowd’s behaviors at the target region, which ef¬ 
fectively alleviates the problem of user interest drift, while CKNN 
does not consider the issue since the local experts discovered by 
CNKK have same or similar interests with target users. 2) UPS-CF 
drops behind Geo-SAGE and LCA-LDA, showing the advantages 
of using latent topic models to capture users’ interests by exploit¬ 
ing the content of their visited spatial items. Through the medium 
of content, Geo-SAGE and LCA-LDA transfer the users’ interests 
inferred at home town to out-of-town regions. In contrast, UPS-CF 
is a mixture of collaborative filtering and social filtering, which ig¬ 
nores the effect of content. Besides, according to the recent survey 
in m, for movement farther than 100km from home location, the 
probability of visiting the exact same location as a friend has vis¬ 
ited in the past is low. 3) Geo-SAGE achieves much higher recall 
value than LCA-LDA, showing the advantages of sparse additive 
modeling over the traditional mixture modeling. Additionally, the 
superiority of Geo-SAGE is partly due to the distinguishing be¬ 
tween the native preference and the tourist preference. 

Figure|5] shows the recommendation effectiveness on the Twitter 
dataset. As the social relationship is unavailable on this dataset, 
UPS-CF has not been evaluated on this dataset. The trend of the 
comparison result is similar to that presented in Figure |4| In both 
settings, Geo-SAGE performs the best, followed by LCA-LDA. 

There are two parameters in Geo-SAGE, namely, the height of 
spatial pyramid (H) and the number of topics (K). The experi¬ 
mental results presented above are obtained with the optimal pa¬ 
rameter settings: (1) the optimal height of spatial pyramid is 5 for 
both Foursquare and Twitter datasets; (2) the optimal values of K 
are 50 for the Foursquare dataset, and 100 for the Twitter dataset. 

4.3 Impact of Different Factors 

To validate the benefits brought by exploiting the native prefer¬ 
ence or the tourist preference, distinguishing between tourist pref¬ 
erence and native preference, as well as spatial smoothing based 
on the spatial pyramid, respectively, we compare Geo-SAGE with 
the three variant versions: Geo-SAGE-Sl, Geo-SAGE-S2 and Geo- 
SAGE-S3. The comparison results are shown in Figures [6| and 


Figure 7: Impact of Different Factors on Twitter Dataset 

[T] From the results, we observe that Geo-SAGE consistently out¬ 
performs the three variant versions for both out-of-town recom¬ 
mendation and home-town recommendation, indicating the benefits 
brought by each factor, respectively. For instance, the performance 
gap between Geo-SAGE and Geo-SAGE-S3 validates the effective¬ 
ness of exploiting the geographical correlation and smoothing both 
native and tourist preferences over the spatial pyramid to alleviate 
data sparsity. We also observe that Geo-SAGE-S2 and Geo-SAGE- 
S3 outperform Geo-SAGE-Sl consistently, showing the advantages 
brought by exploiting the crowd’s activity histories when individual 
user’s behavior data is sparse. Another observation is that the per¬ 
formance gap between our Geo-SAGE model and the three variants 
in the task of home-town recommendation is smaller than that in 
the task of out-of-town recommendation, showing that the perfor¬ 
mance differences among these four methods become less obvious 
when people travel in home town where the issue of data sparsity 
is not serious. 


Height 

Foursquare 

Twitter 

Re@2 

Re@10 

Re@20 

Re @2 

Re@10 

Re@20 

2 

0.282 

0.511 

0.616 

0.287 

0.459 

0.602 

3 

0.301 

0.519 

0.620 

0.292 

0.537 

0.615 

4 

0.319 

0.528 

0.629 

0.318 

0.559 

0.635 

5 

0.327 

0.535 

0.637 

0.353 

0.573 

0.653 

6 

0.296 

0.504 

0.605 

0.289 

0.555 

0.618 

7 

0.270 

0.465 

0.567 

0.256 

0.537 

0.582 


Table 4: Impact of Height of Spatial Pyramid 


To further study the effect of spatial pyramid in Geo-SAGE, we 
tried different parameter setups for the height of the spatial pyra¬ 
mid. Tuning this parameter is critical to the performance of this 
model. We tested the performance of Geo-SAGE model by varying 
the height of spatial pyramid from 1 to 7, and present the results 
in Table |4] Due to space constraint, we only show the experimen¬ 
tal results for the out-of-town recommendation. From the results, 
we observe that as the height of the spatial pyramid increases, the 
Recall values of Geo-SAGE first increase, and then decrease. One 
possible reason for early increasing Recall values is that increas¬ 
ing the height increases the exploitation of spatial effect and makes 
the inference of native and tourist preferences more precise. Later 
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(a) With Less than 5 Activities (b) With 0 Activities 

Figure 8: Recommendation for Cold-start Users 

on, Recall decreases as the height of spatial pyramid gets larger, 
because increasing the height makes users’ activity data in a re¬ 
gion cell sparser. Geo-SAGE achieves its best performance when 
the height of spatial pyramid is set to 5 on the two datasets, which 
could be a tradeoff of aforementioned two factors. 

4.4 Test for Cold Start Problem 

We further conduct experiments to study the effectiveness of dif¬ 
ferent recommendation algorithms handling the cold start problem. 
Cold start is a critical problem in the domain of recommendation, 
where users have not visited any spatial item, or have visited only 
a few items. We test the recommendation effectiveness for cold- 
start users on the Foursquare dataset and present the results in Fig¬ 
ure [8] Figure [Ha) reports the result for users with less than 5 activ¬ 
ity records, and Figure (Hb) reports the result for users without any 
activity record. 

By comparing Figures |4| and [8j we observe that, although the 
recommendation effectiveness of all algorithms decrease to vari¬ 
ous degrees, our proposed Geo-SAGE still performs the best. Note 
that it is very hard to capture cold-start users’ preference from 
extremely limited number of spatial items visited by target users. 
That is why CKNN performs worst by using user activity history 
only. Geo-SAGE, LCA-LDA and UPS-CF overcome the scarce of 
user activity history by exploiting the wisdom of the crowd and 
social friends, respectively. The behaviors of the crowd or social 
friends at the target region supply many valuable references and 
clues that are potentially useful for spatial item recommendations. 
Geo-SAGE outperforms LCA-LDA due to the fact that Geo-SAGE 
distinguishes the tourist preference from the native preference, and 
a user is more likely to visit the spatial items preferred by the crowd 
who have the same role as him/her at the target region. Note that all 
users’ home town information is known in the Foursquare dataset. 
Thus, for users with no activity records, Geo-SAGE can also iden¬ 
tify whether a user is a tourist or a local and then recommend the 
most popular items preferred by the tourists or the locals in the tar¬ 
get region. Another observation is that both Geo-SAGE and LCA- 
LDA outperform UPS-CF significantly. This is because UPS-CF 
is a memory-based method which suffers from severe data spar¬ 
sity, while Geo-SAGE and LCA-LDA are latent class models that 
can handle the data sparsity problem to a great extent by reducing 
dimension and integrating content information. 

Figure |9] illustrates the performance of Geo-SAGE for cold-start 
users in home-town and out-of-town recommendation, n denotes 
the maximum number of activity records that each cold-start user 
has in the training set. From the results, we observe that for cold- 
start users, Geo-SAGE achieves better performance in the task of 
out-of-town recommendation than in the task of home-town rec¬ 
ommendation. Besides, as n increases from 0 to 5, Geo-SAGE 
achieves a larger performance increase in home-town recommen¬ 
dation than in out-of-town recommendation. The results indicates 
that for cold-start users, the tourist preference plays a more impor- 
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Figure 9: Home-town and Out-of Town Recommendation for 
Cold-start Users 

tant role in improving out-of-town recommendation than the native 
preference does in improving home-town recommendation. This is 
because users’ visiting behaviors at home town are more personal¬ 
ized and diverse, while their out-of-town behaviors tend to be more 
consistent. That is, different users tend to visit the same spatial 
items when they travel in the same out-of-town region. On the other 
hand, the results indicate that users’ personal interests have more 
infiuence on home-town recommendation than on out-of-town rec¬ 
ommendation. That is why home-town recommendation is more 
sensitive to the number of user historical activities. 

5. RELATED WORK 

Spatial item recommendation, also called location or place rec¬ 
ommendation, has been considered as an essential task in the do¬ 
main of recommender system. Recently, with the easy access of 
large-scale user activity records in LBSNs, many recent work has 
tried to improve spatial recommendation by exploiting and integrat¬ 
ing geographical and social infiuence, temporal effect and content 
information of POIs. 

Geo-Social Influence. Geo-Social influence indicates that peo¬ 
ple tend to explore nearby POIs of a POI that they or their friends 
have visited before ||2^ . Many recent studies WMWM showed 
that there is a strong correlation between user check-in activities 
and geographical distance as well as social connections, so most 
of current POI recommendation work mainly focus on leveraging 
the geographical and social influences to improve recommendation 
accuracy. For example. Ye et al. 03 delved into POI recommenda¬ 
tion by investigating the geographical influences among locations 
and proposed a system that combines user preferences, social influ¬ 
ence and geographical influence. Cheng et al. n investigated the 
geographical influence through combining a multi-center Gaussian 
model, matrix factorization and social influence together for loca¬ 
tion recommendation. Lian et al. 03 incorporated spatial cluster¬ 
ing phenomenon resulted by geographical influence into a weighted 
matrix factorization framework to deal with the matrix sparsity. 
Ference et al. 13 designed a collaborative recommendation frame¬ 
work which considers the activity records generated by both friends 
and similar users in a mixture way. 

Temporal Effect. The temporal effect of user check-in activi¬ 
ties in LBSNs has also attracted much attention from researchers. 
POI recommendation with temporal effect mainly leverages tem¬ 
poral cyclic patterns and temporal sequential patterns on LBSNs. 
Gao et al. ifTTl investigated the temporal cyclic patterns of user 
check-ins in terms of temporal non-uniformness and temporal con¬ 
secutiveness. Yuan et al. also incorporated the temporal cyclic 
information into a user-based collaborative filtering framework for 
time-aware POI recommendation. Cheng et al. m introduced the 
task of successive personalized POI recommendation in LBSNs by 
embedding the temporal sequential patterns. 

Content Information. Most recently, researchers explored the 
content information of POIs to alleviate the problem of data spar- 









































sity. Hu et al. (H proposed a spatial topic model for POI rec¬ 
ommendation considering both spatial aspect and textual aspect of 
user posts from Twitter. Yin et al. ED exploited both personal in¬ 
terests and local preferences based on the contents associated with 
spatial items. Liu et al. ED Studied the effect of POI-associated 
tags for POI recommendation with an aggregated LDA and matrix 
factorization method. Gao et al. na studied both POI-associated 
contents and user sentiment information (e.g., user comments) into 
POI recommendation and reported their good performance. 

As described above, while there are many studies to alleviate the 
data sparsity by exploiting geographical-social influence, temporal 
effect and content information, they did not address the challenges 
arising from either travel locality or interest drift for the out-of- 
town recommendation. For example, most of the above work as¬ 
sumed that users are in their home towns, they did not consider 
user interest drift across regions. Our work in this paper distin¬ 
guishes from previous work in several points. First, to the best of 
our knowledge, we are the first to simultaneously address the three 
challenges in a unified model. Second, although both ED and Gol 
exploited the local crowd’s preferences, they ignored the user’s role 
and did not distinguish native preference from tourist preference. 
Third, we proposed a novel effective method to represent and infer 
both native and tourist preferences based on a well-designed spatial 
index structure. 

6. CONCLUSION AND FUTURE WORK 

In this paper, we proposed a geographical sparse additive genera¬ 
tive model, Geo-S AGE, for spatial item recommendation, which ef¬ 
fectively overcomes the challenges arising from data sparsity, travel 
locality and interest drift. Specifically, to combat data sparsity and 
travel locality, Geo-SAGE exploited both the co-occurrence pat¬ 
tern of spatial items and their content to infer and transfer user in¬ 
terests. To address interest drift, Geo-SAGE incorporated the na¬ 
tive or tourist preference at the target location. To alleviate the 
data sparsity confronted by the inference of native preference and 
tourist preference for each region, Geo-SAGE employed an ad¬ 
ditive framework to smooth the preferences over a well-designed 
spatial pyramid. We conducted extensive experiments to evalu¬ 
ate the performance of our Geo-SAGE model on two real large- 
scale datasets. The experimental results revealed the advantages 
of the Geo-SAGE model over other spatial item recommendation 
methods, for both home-town and out-of-town recommendations, 
demonstrating the effectiveness of Geo-SAGE in facilitating users 
to travel in their home towns as well as in regions they are not fa¬ 
miliar with. 

As a promising research direction, we would like to explore en¬ 
hancements to our model by integrating the social infiuence and 
temporal effect. 
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